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Abstract 



This paper provides an expository introduction to monotone and near-monotone biochemical network 
structures. Monotone systems respond in a predictable fashion to perturbations, and have very robust 
dynamical characteristics. This makes them reliable components of more complex networks, and suggests 
that natural biological systems may have evolved to be, if not monotone, at least close to monotone. In 
addition, interconnections of monotone systems may be fruitfully analyzed using tools from control theory. 

> 

£5 ! 1 Introduction 

(N ; 

■ In cells, biochemical networks consisting of proteins, RNA, DNA, metabolites, and other species, are responsi- 

■ ble for control and signaling in development, regulation, and metabolism, by processing environmental signals, 
q | sequencing internal events such as gene expression, and producing appropriate cellular responses. The field of 

systems molecular biology is largely concerned with the study of such networks. Often, as in control theory, 
biochemical networks are viewed as interconnections of simpler subsystems. This paper discusses recent work 
which makes use of topology (graph structure) as well as sign information regarding subsystems and their 
interconnection structure in order to infer properties of the complete system. 

It is broadly appreciated that behavior is critically dependent on network topology as well as on the signs 
(activating or inhibiting) of the underlying feedforward and feedback interconnections [1—11]. For example, 
Figures QJa-c) show the three possible types of feedback loops that involve two interacting chemicals,. A 






B 



Figure 1: (a) Mutual activation. (b) Mutual inhibition. (c) Activation-inhibition 




mutual activation configuration is shown in Figure Uta) : a positive change in A results in a positive change in 
B, and vice- versa. Configurations like these are associated to signal amplification and production of switch- 
like biochemical responses. A mutual inhibition configuration is shown in Figure [Hb): a positive change in A 
results in repression of B, and repression of B in turn enhances A. Such configurations allow systems to exhibit 
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multiple discrete, alternative stable steady-states, thus providing a mechanism for memory. Both (a) and (b) 
are examples of positive-feedback systems [12-19]. On the other hand, activation-inhibition configurations like 
in Figure [U^c) are necessary for the generation of periodic behaviors such as circadian rhythms or cell cycle 
oscillations, by themselves or in combination with multi-stable positive-feedback subsystems, as well as for 
adaptation, disturbance rejection, and tight regulation (homeostasis) of physiological variables [7,9,11,20-28]. 
Compared to positive-feedback systems, negative-feedback systems are not "consistent," in a sense to be 
made precise below but roughly meaning that different paths between any two nodes should reinforce, rather 
than contradict, each other. For (c), a positive change in A will be resisted by the system through the 
feedback loop. Consistency, or lack thereof, also plays a role in the behavior of graphs without feedback; for 
example [10,29,30] deal with the different signal processing capabilities of consistent ("coherent") compared 
to inconsistent feedforward motifs. 

A key role in the work to be discussed here will be played by consistent systems and subsystems. We will 
discuss the following points: 

• Interesting and nontrivial conclusions can be drawn from (signed) network structure alone. This struc- 
ture is associated to purely stoichiometric information about the system and ignores fluxes. Consistency, 
or close to consistency, is an important property in this regard. 

• Interpreted as dynamical systems, consistent networks define monotone systems, which have highly 
predictable and ordered behavior. 

• It is often useful to analyze larger systems by viewing them as interconnections of a small number 
of monotone subsystems. This allows one to obtain precise bifurcation diagrams without appeal to 
explicit knowledge of fluxes or of kinetic constants and other parameters, using merely "input /output 
characteristics" (steady-state responses or DC gains). The procedure may be viewed as a "model 
reduction" approach in which monotone subsystems are viewed as essentially one-dimensional objects. 

• The possibility of performing a decomposition into a small number of monotone components is closely 
tied to the question of how "near" a system is to being monotone. 

• We argue that systems that are "near monotone" are biologically more desirable than systems that are 
far from being monotone. 

• There are indications that biological networks may be much closer to being monotone than random 
networks that have the same numbers of vertices and of positive and negative edges. 

The need for robust structures and robust analysis tools 

A distinguishing feature of the study of dynamics in cell biology, in contrast to more established areas of applied 
mathematics and engineering, is the high degree of uncertainty inherent in models of cellular biochemical 
networks. This uncertainty is due to environmental fluctuations, and variability among different cells of the 
same type, as well as, from a mathematical analysis perspective, the difficulty of measuring the relevant model 
parameters (kinetic constants, cooperativity indices, and many others) and thus the impossibility of obtaining 
a precise model. Thus, it is imperative to develop tools that are "robust" in the sense of being able to provide 
useful conclusions based largely upon information regarding the qualitative features of the network, and not 
the precise values of parameters or even the forms of reactions. Of course, this goal is often not achievable, 
since dynamical behavior may be subject to phase transitions (bifurcation phenomena) which are critically 
dependent on parameter values. 

Nevertheless, and surprisingly, research by many, notably by Clarke [31], Horn and Jackson [32,33], and 
Feinberg [34-36], relying upon complex balancing and deficiency theory, has resulted in the identification 
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of rich classes of chemical network structures for which such robust analysis is indeed possible. A different 
direction of work, pioneered by Hirsch and Smith [37,38], relies upon the theory of monotone systems, and has 
a similar goal of drawing conclusions about dynamical behavior based only upon structure. This direction has 
been enriched substantially by the introduction of monotone systems with inputs and outputs: as standard 
in control theory [24], one extends the notion of monotone system so as to incorporate input and output 
channels [39]. Once inputs and outputs are introduced, one can study interconnections of systems (Figure [2]), 
and ask what special properties hold if the subsystems are monotone [19,39,40]. 

inputs to subsystem 2 outputs of subs y S tem 2 



input 

to system 




output 
of system 



Figure 2: A system composed of four subsystems 



2 Consistent Graphs, Monotone Systems, and Near-Monotonicity 

We now introduce the basic notions of monotonicity and consistency. The present section deals exclusively 
with graph-theoretic information, which is derived from stoichiometric constraints. Complementary to this 
analysis, bifurcation phenomena can be sometimes analyzed using a combination of these graphical techniques 
together with information on steady-state gains; that subject is discussed in Part II. The discussion is informal; 
in Part II has more rigorous mathematical statements, presented in the more general context of systems with 
external inputs and outputs. 

The systems considered here are described by the evolution of states, which are time-dependent vectors 
x(t) = (xi(t), • • • , x n (t)) whose components Xj represent concentrations of chemical species such as proteins, 
mRNA, or metabolites. In autonomous differential equation ("continuous-time") models, one specifies the 
rate of change of each variable, at any given time, as a function of the concentrations of all the variables at 
that time: 

dxA 

-±{t) = f 1 (x 1 (t),x 2 (t),...,x n (t)) 

^(t) = f2(xi(t),X 2 (t),...,X n (t)) 

dx 

-Jfit) = fn{x 1 (t),X 2 (t),...,X n (t)), 

or just dx/dt = f(x), where / is the vector function with components We assume that the coordinates 
Xi of the state of the system can be arbitrary non-negative numbers. (Constraints among variables can be 
imposed as well, but several aspects of the theory are more subtle in that case.) Often, one starts from a 
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differential equation system written in the following form: 

where R(x) is a (/-dimensional vector of reactions and r is an n x q matrix, called the stoichiometry matrix, 
and either one studies this system directly, or one studies a smaller set of differential equations dx/dt = f(x) 
obtained by eliminating variables through the use of conserved stoichiometric quantities. 

We will mostly discuss differential equation models, but will also make remarks concerning difference equation 
("discrete time") models. The dynamics of these are described by rules that specify the state at some future 
time t = as a function of the state of the system at the present time t k . Thus, the ith coordinate evolves 
according to an update rule: 

Xi(t k +i) = fi(xi{t k ), x 2 {t k ), x n {t k )) 

instead of being described by a differential equation. Usually, t k = &A, where A is a uniform inter-sample 
time. One may associate a difference equation to any given differential equation, through the rule that the 
vector x{t k+ i) should equal the solution of the differential equation when starting at state x(t k ). However, 
not every difference equation arises from a differential equation in this manner. Difference equations may 
be more natural when studying processes in which measurements are made at discrete times, or they might 
provide a macroscopic model of an underlying stochastic process taking place at a faster time scale. 

One may also study more complicated descriptions of dynamics that those given by ordinary differential and 
difference equations; many of the results that we discuss here have close analogs that apply to more general 
classes of (deterministic) dynamical systems, including reaction-diffusion partial differential equations, which 
are used for space-dependent problems with slow diffusion and no mixing, delay-differential systems, which 
help model delays due to transport and other cellular phenomena in which concentrations of one species only 
affect others after a time interval, and integro-differential equations [37,38,41,42]. In a different direction, 
one may consider systems with external inputs and outputs [39]. 



The graph associated to a system 



There are at least two types of graphs that can be naturally associated to a given biochemical network. One 
type, sometimes called the species-reaction graph, is a bipartite graph with nodes for reactions (fluxes) and 
species, which leads to useful analysis techniques based on Petri net theory and graph theory [43-50]. We will 
not discuss species-reaction graphs here. A second type of graph, which we will discuss, is the species graph 
G. It has n nodes (or "vertices"), which we denote by v±, . . . , v n , one node for each species. No edge is drawn 
from node Vj to node Vi if the partial derivative §^(x) vanishes identically, meaning that there is no direct 
effect of the jth species upon the ith species. If this derivative is not identically zero, then there are three 
possibilities: (1) it is > for all x, (2) it is < for all x, or (3) it changes sign depending on the particular 
entries of the concentration vector x. In the first case (activation), we draw an edge labeled +, +1, or just an 
arrow — >. In the second case (repression or inhibition), we draw an edge labeled — , —1, or use the symbol H. 
In the third case, when the sign is ambiguous, we draw both an activating and an inhibiting edge from node 
vj to node i>j. 

For continuous-time systems, no self-edges (edges from a node V{ to itself) are included in the graph G, 
whatever the sign of the diagonal entry dfi/dxi of the Jacobian. For discrete-time systems, on the other 
hand, self-edges are included (we later discuss the reason for these different definitions for differential and 
difference equations). 

When working with graphs, it is more convenient (though not strictly necessary) to consider only graphs G 
that have no multiple edges from one node to another (third case above) . One may always assume that G has 
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this property, by means of the following trick: whenever there are two edges, we replace one of them by an 
indirect link involving a new node; see Figj3j Introducing such additional nodes if required, we will suppose 




Figure 3: Replacing direct inconsistent effects by adding a node 
from now on that no multiple edges exist. 

Although adding new edges as explained above is a purely formal construction with graphs, it may be explained 
biologically as follows. Often, ambiguous signs in Jacobians reflect heterogeneous mechanisms. For example, 
take the case where protein A enhances the transcription rate of gene B if present at high concentrations, 
but represses B if its concentration is lower than some threshold. Further study of the chemical mechanism 
might well reveal the existence of, for example, a homodimer that is responsible for this ambiguous effect. 
Mathematically, the rate of transcription of B might be given algebraically by the formula A^a 2 — k±a, where a 
denotes the concentration of A. Introducing a new species C to represent the homodimer, we may rewrite this 
rate as h,2C — k\a, where c is the concentration of C, plus an new equation like dc/dt = k^a 2 — k^c representing 
the formation of the dimer and its degradation. This is exactly the situation in Figj3j 

Spin assignments and consistency 

A spin assignment £ for the graph G is an assignment, to each node Vi, of a number crj equal to or "—1" 
(a "spin," to borrow from statistical mechanics terminology). In graphical depictions, we draw up-arrows or 
down-arrows to indicate spins. If there is an edge from node Vj to node Vi, with label J{j £ {±1}, we say that 
this edge is consistent with the spin assignment £ provided that: 

JijO-iUj = 1 

which is the same as saying that J™ = <7j<7j, or that <7j = JijCj. An equivalent formalism is that in which 
edges are labeled by "0" or "1," instead of 1 and —1 respectively, and edge labels Jij belong to the set {0, 1}, 
in which case consistency is the property that Jy © Oi © Oj = (sum modulo two). 

We will say that £ is a consistent spin assignment for the graph G (or simply that G is consistent) if every 
edge of G is consistent with £. In other words, for any pair of vertices Vi and Vj, if there is a positive edge 
from node Vj to node Vi, then Vj and Vi must have the same spin, and if there is a negative edge connecting 
Vj to Vi, then Vj and Vi must have opposite spins. (If there is no edge from Vj to Vi, this requirement imposes 
no restriction on their spins.) 

In order to decide whether a graph admits any consistent spin assignment, it is not necessary to actually 
test all the possible 2 n spin assignments. It is very easy to prove that there is a consistent assignment if and 
only if every undirected loop in the graph G has a net positive sign, that is to say, an even number, possibly 
zero, of negative arrows. Equivalently, any two (undirected) paths between two nodes must have the same 
net sign. By undirected loops or paths, we mean that one is allowed to transverse an edge either forward or 
backward. A proof of this condition is as follows. If a consistent assignment exists, then, for any undirected 
loop Vi x , . . . ,Vi k = Vi x starting from and ending at the node v^, inductively one has that: 

a i\ = Qh,ik-i Qik-i,ik-2 • ■ ■ > Qh,h a h 
K 



where = Jij if we are transversing the edge from Vj to Vi, or Qij = Jji if we are transversing backward 
the edge from Vj to V{. This implies (divide by a^) that the product of the edge signs is positive. Conversely, 
if any two paths between nodes have the same parity, and the graph is connected, pick node v\ and label it 
"+" and then assign to every other node V{ the parity of a path connecting v\ and V{. (If the graph is not 
connected, do this construction on each component separately.) 

This positive-loop property, in turn, can be checked with a fast dynamic programming-like algorithm. For 
connected graphs, there can be at most two consistent assignments, each of which is the reverse (flip every 
spin) of the other. 

Monotone systems 

A dynamical system is said to be monotone if there exists at least one consistent spin assignment for its 
associated graph G. Monotone systems [37, 51, 52] were introduced by Hirsch, and constitute a class of 
dynamical systems for which a rich theory exists. (To be precise, we have only defined the subclass of systems 
that are monotone with respect to some orthant order. The notion of monotonicity can be defined with respect 
to more general orders.) 

Consistent response to perturbations 

Monotonicity reflects the fact that a system responds consistently to perturbations on its components. Let 
us now discuss this property in informal terms. We view the nodes of the graph shown in Figure IU(a) 
as corresponding to variables in the system, which quantify the concentrations of chemical species such as 
activated receptors, proteins, transcription factors, and so forth. Suppose that a perturbation, for example 



Figure 4: (a) and (b) graph and consistent assignment, (c) and (d) no possible consistent assignments 

due to the external activation of a receptor represented by node 1, instantaneously increases the value of 
the concentration of this species. We represent this increase by an up-arrow inserted into that node, as 
in Figure [4](b) . The effect on the other nodes is then completely predictable from the graph. The species 
associated to node 2 will decrease, because of the inhibiting character of the connection from 1 to 2, and the 
species associated to node 3 will increase (activating effect). Where monotonicity plays a role is in insuring 
that the concentration of the species corresponding to node 4 will also increase. It increases both because it 
is activated by 3, which has increased, and because it is inhibited by 2, so that less of 2 implies a smaller 
inhibition effect. Algebraically, the following expression involving partial derivatives: 




dhdh 
8x3 dx\ 



+ 



dhdh 
dx2 dx\ 



(where fa gives the rate of change of the ith species, in the differential equation model) is guaranteed to 
be positive, since it is a sum of positive terms: (+)(+) + (—)(—)• Intuitively, the expression measures the 
sensitivity of the rate of change dx^/dt of the concentration of 4 with respect to perturbations in 1, with 
the two terms giving the contributions for each of the two alternative paths from node 1 to node 4. This 
unambiguous global effect holds true regardless of the actual values of parameters such as kinetic constants, 
and even the algebraic forms of reactions, and depends only on the signs of the entries of the Jacobian of 
f . Observe that the arrows shown in Figure S^b) provide a consistent spin-assignment for the graph, so the 
system is monotone. 

In contrast, consider next the graph in Figure S^c), where the edge from 1 to 2 is now positive. There are 
two paths from node 1 to node 4, one of which (through 3) is positive and the other of which (through 
2) is negative. Equivalently, the undirected loop 1,3,4,2,1 ("undirected" because the last two edges are 
transversed backward) has a net negative parity. Therefore, the loop test for consistency fails, so that there 
is no possible consistent spin-assignment for this graph, and therefore the corresponding dynamical system is 
not monotone. Reflecting this fact, the net effect of an increase in node 1 is ambiguous. It is impossible to 
conclude from the graphical information alone whether node 4 will be repressed (because of the path through 
2) or activated (because of the path through 3). There is no way to resolve this ambiguity unless equations 
and precise parameter values are assigned to the arrows. 

To take a concrete example, suppose that the equations for the system are as follows: 

dx\ n dx 2 dx 3 dx4 . 

"° ~dT = Xl ~dT = Xl = ^(hx 3 - k 2X2 ) , 

where the reaction constants k 2 and ks are two positive numbers. The initial conditions are taken to be 
xi(0) = £4(0) = 1, and 2:2(0) = £3(0) = 0, and we ask how the solution x 4 (t) will change when the initial 
value xi(0) is perturbed. With xi(0) = 1, the solution is x±(t) = e at I 2 , where a = k% — k 2 . On the other 
hand, if xi(0) is perturbed to a larger value, let us say x\ (0) = 2, then x 4 (t) = e at . This new value of x^(t) 
is larger than the original unperturbed value e at I 2 provided that a > 0, but it is smaller than it if, instead, 
a < 0. In other words, the sign of the sensitivity of X4 to a perturbation on x\ cannot be predicted from 
knowledge of the graph alone, but it depends on whether k 2 < A; 3 or k 2 > k%. Compare this with the monotone 
case, as in Figure a). A concrete example is obtained if we modify the x 2 equation to dx 2 /dt = 1/(1 +X\). 
Now the solutions are x 4 (t) = e^ 1 * and x^{t) = e^ 2t respectively, with (3\ = k%/2 — k 2 /4 and (5 2 = k% — k 2 /6, 
so we are guaranteed that X4 is larger in the perturbed case, a conclusion that holds true no matter what are 
the numerical values of the (positive) constants ki. 

The uncertainty associated to a graph like the one in Figure [4^c) might be undesirable in natural systems. 
Cells of the same type differ in concentrations of ATP, enzymes, and other chemicals, and this affects the values 
of model parameters, so two cells of the same type may well react differently to the same "stimulus" (increase 
in concentration of chemical 1). While such epigenetic diversity is sometimes desirable, it makes behavior less 
predictable and robust. From an evolutionary viewpoint, a "change in wiring" such as replacing the negative 
edge from 1 to 2 by a positive one (or, instead, perhaps introducing an additional inconsistent edge) could 
lead to unpredictable effects, and so the fitness of such a mutation may be harder to evaluate. In a monotone 
system, in contrast, a stimulus applied to a component is propagated in an unambiguous manner throughout 
the circuit, promoting a predictably consistent increase or consistent decrease in the concentrations of all 
other components. 

Similarly, consistency also applies to feedback loops. For example, consider the graph shown in Figure Hl[d). 
The negative feedback given by the inconsistent path 1,3,4,2,1 means that the instantaneous effect of an 
up-perturbation of node 1 feeds back into a negative effect on node 1, while a down-perturbation feeds back 
as a positive effect. In other words, the feedback loop acts against the perturbation. 
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Of course, negative feedback as well as inconsistent feedforward circuits are important components of biomolec- 
ular networks, playing a major role in homeostasis and in signal detection. The point being made here is that 
inconsistent networks may require a more delicate tuning in order to perform their functions. 

In rigorous mathematical terms, this predictability property can be formulated as Kamke 's Theorem. Suppose 
that S = {cij, i = 1, . . . , n} is a consistent spin assignment for the system graph G. Let x(t) be any solution of 
dx/dt = f(x). We wish to study how the solution z(t) arising from a perturbed initial condition z(0) = x(0)+A 
compares to the solution x(t). Specifically, suppose that a positive perturbation is performed at time t = 
on the ith coordinate, for some index i G {1, . . . ,n}: Zi(0) > Xj(0) and Zj(0) = for all j ^ i. For 

concreteness, let us assume that the perturbed node i has been labeled by a\ = +1. Then, Kamke's Theorem 
says the following: for each node that has the same parity (i.e., each index j such that <jj = +1), and for every 
future time t, Zj(t) > Xj(t). Similarly, for each node with opposite parity (erj = —1), and for every time t, 
Zj(t) < Xj{t). (Moreover, one or more of these inequalities must be strict.) This is the precise sense in which 
an up-perturbation of the species represented by node Vi unambiguously propagates into up- or down-behavior 
of all the other species. See [37] for a proof, and see [39] for generalizations to systems with external input 
and output channels. 

For difference equations (discrete time systems), once that that self- loops have been included in the graph G 
and the definition of consistency, Kamke's theorem also holds; in this case the proof is easy, by induction on 
time steps. 

Consistent graphs can be embedded into larger consistent ones, but inconsistent ones cannot. For example, 
consider the graph shown in Figure EJa). This graph admits no consistent spin assignment since the undirected 




Figure 5: (a) inconsistent, (b) consistent, (c) adding node to consistent network 



loop 1, 3, 4, 2, 1 has a net negative parity. Thus, there cannot be any consistent graph that includes this graph 
as a subgraph. Compare this with the graph shown in Figure EJ^b). Consistency of this graph may well 
represent consistency of a larger graph which involves a yet-undiscovered species, such as node 5 in Figure 0c). 
Alternatively, and from an "incremental design" viewpoint, this graph being consistent makes it possible to 
consistently add node 5 in the future. 

Removing the smallest number of edges so as to achieve consistency 

Let us call the consistency deficit (CD) of a graph G the smallest possible number of edges that should be 
removed from G in order that there remains a consistent graph, and, correspondingly, a monotone system. 

As an example, take the graph shown in Figure [6|a). For this graph, it suffices to remove just one edge, the 
diagonal positive one, so the CD is 1. (In this example, the solution is unique, in that no other single other 
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Figure 6: (a) inconsistent graph, (b) consistent subgraph, (c) one inconsistent edge 



edge would suffice, but for other graphs there are typically several alternative ways to achieve consistency 
with a minimal number of deletions.) 

After deleting the diagonal, a consistent spin assignment E is: o\ = 03 = 1 and 02 = 04 = —1, see FigureEt a )- 
(Another assigment is the one with all spins reversed: o\ = 02 = — 1 and 03 = 04 = 1.) If we now bring back 
the deleted edge, we see that in the original graph only the one edge from node 1 to node 4 is inconsistent for 
the spin assignment E (Figure Etc)). 

This example illustrates a general fact: minimizing the number of edges that must be removed so that there 
remains a consistent graph is equivalent to finding a spin assignment E for which the number of inconsistent 
edges (those for which JijGiGj = — 1) is minimized. 

Yet another rephrasing is as follows. For any spin assignment E, let A\ be the subset of nodes labeled +1, 
and let A—\ be the subset of nodes labeled —1. The set of all nodes is partitioned into A\ and A—\. (In 
Figure Et c ), we have A\ = {1,3} and A—\ = {2,4}.) Conversely, any partition of the set of nodes into two 
subsets can be thought of as a spin assignment. With this interpretation, a consistent spin assigment is the 
same as a partition of the node set into two subsets A\ and A_\ in such a manner that all edges between 
elements of A\ be positive, all edges between elements of A—\ be positive, and all edges between a node in A\ 
and a node in A-\ be negative, see FigJTJ More generally, computing the CD amounts to finding a partition 
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Figure 7: (a) Consistent graph; (b) partition into A\ and A^\ 



so that n\ + n-i + p is minimized, where m is the number of negative edges between nodes in A\, n_\ is the 
number of negative edges between nodes in A-i, and p is the number of positive edges between nodes in Ai 
and 

A very special case is when the graph has all of its edges labeled negative, that is, Jjj = — 1 for all i,j. Stated 
in the language of partitions, the CD problem amounts to searching for a partition such that n\ + n_i is 
minimized (as there are no positive edges, p = 0). Moreover, since there are no positive edges, n% + n_i is 
actually the total number of edges between any two nodes in A\ or in A-\. Thus, — (ni + n_i) is the 
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number of remaining edges, that is, the number of edges between nodes in A\ and A-\. Therefore, minimizing 
n\ + n„i is the same as maximizing N — (ni + This is precisely the standard "MAX-CUT" problem in 

computer science. 

As a matter of fact, not only is MAX-CUT a particular case, but, conversely, it is possible to reduce the CD 
problem to MAX-CUT by means of the following trick. For each edge labeled +1, say from Uj to Vj, delete 
the edge but insert a new node Wij, and two negative edges, one from vi to Wij and one from Wij to vf 

Vi — > vj ~> Vi H Wij H Vj . 

The enlarged graph has only negative edges, and it is easy to see that the minimal number of edges that have 
to be removed in order to achieve consistency is the same as the number of edges that would have had to be 
removed in the original graph. Unfortunately, the MAX-CUT problem is NP-hard. However, the paper [53] 
gave an approximation polynomial-time algorithm for the CD problem, guaranteed to solve the problem to 
within 87.9% of the optimum value, as an adaptation of the semi-definite programming relaxation approach 
to MAX-CUT based on Goemans and Williamson's work [54]. (Is not enough to simply apply the MAX-CUT 
algorithm to the enlarged graph obtained by the above trick, because the approximation bound is degraded 
by the additional edges, so the construction takes some care.) 

Relation to Ising spin-glass models 

Another interpretation of CD uses the language of statistical mechanics. An Ising spin- glass model is defined 
by a graph G together with an "interaction energy" Jij associated to each edge (in our conventions, J™ is 
associated to the edge from Vj to Vi). In binary models, Jij £ {1, —1}, as we have here. A spin-assignment £ 
is also called a (magnetic) "spin configuration." A "non-frustrated" spin-glass model is one for which there 
is a spin configuration for which every edge is consistent [55-57]. This is the same as a consistent assignment 
for the graph G in our terminology. Moreover, a spin configuration that maximizes the number of consistent 
edges is one for which the "free energy" (with no exterior magnetic field): 

H(E) = - JijO-jQ-j 

ij 

is minimized. This is because, if £ results in C(S) consistent edges, then H(Y,) = — C(£) + I(£) = T — 2C(S), 
where /(£) is the number of non-consistent edges for the assignment £ and T = C + 1 is the total number of 
edges; thus, minimizing H(Yi) is the same as maximizing C(£). A minimizing £ is called a "ground state". 
(A special case is that in which Jj,- = —1 for all edges, the "anti-ferromagnetic case". This is the same as the 
MAX-CUT problem.) 

Near-monotone systems may be "practically" monotone 

Obviously, there is no reason for large biochemical networks to be consistent, and they are not. However, when 
the number of inconsistencies in a biological interaction graph is small, it may well be the case that the network 
is in fact consistent in a practical sense. For example, a gene regulatory network represents all potential effects 
among genes. These effects are often mediated by proteins which themselves need to be activated in order 
to perform their function, and this activation will, in turn, be contingent on the "environmental" context: 
extracellular ligands, additional genes being expressed which may depend on cell type or developmental stage, 
and so forth. Thus, depending on the context, different subgraphs of the original graph describe the system, 
and these graphs may be individually consistent even if the entire graph, the union of all these subgraphs, 
is not. As an illustration, take the system in Figure |4](c) . Suppose that under environmental conditions A, 



the edge from 1 to 2 is not present, and under non-overlapping conditions B, the edge from 1 to 3 is not be 
present. Then, under either conditions, A or B, the graph is consistent, even though, formally speaking, the 
entire network is not consistent. 

The closer to consistent, the more likely that this phenomenon may occur. 
Some evidence suggesting near-monotonicity of natural networks 

Since consistency in biological networks may be desirable, one might conjecture that natural biological net- 
works tend to be consistent. As a way to test this hypothesis, the CD algorithm from [53] was run on the 
yeast Saccharomyces cerevisiae gene regulatory network from [10], downloaded from [58]. (The authors of [10] 
used the YPD database [59]. Nodes represent genes, and edges are directed from transcription factors, or 
protein complexes of transcription factors, into the genes regulated by them.) This network has 690 nodes 
and 1082 edges, of which 221 are negative and 861 are positive (we labeled the one "neutral" edge as positive; 
the conclusions do not change substantially if we label it negative instead, or if we delete this one edge). 
The algorithm in [53] provides a CD of 43. In other words, deleting a mere 4% of edges makes the network 
consistent. Also remarkable is the following fact. The original graph has 11 components: a large one of size 
664, one of size 5, three of size 3, and six of size 2. All of these components remain connected after edge 
deletion. The deleted edges are all from the largest component, and they are incident on a total of 65 nodes 
in this component. 

To better appreciate if a small CD might happen by chance, the algorithm was also run on random graphs 
having 690 nodes and 1082 edges (chosen uniformly), of which 221 edges (chosen uniformly) are negative. 
It was found that, for such random graphs, about 12.6% (136.6 ± 5) of edges have to be removed in order 
to achieve consistency. (To analyze the scaling of this estimate, we generated random graphs with N nodes 
and 1.57iV edges of which 0.32iV are negative. We found that for N > 10, approximately A/5 nodes must 
be removed, thus confirming the result for N = 690.) Thus, the CD of the biological network is roughly 
15 standard deviations away from the mean for random graphs. Both topology (i.e., the underlying graph) 
and actual signs of edges contribute to this near-consistency of the yeast network. To justify this assertion, 
the following numerical experiment was performed. We randomly changed the signs of 50 positive and 50 
negative edges, thus obtaining a network that has the same number of positive and negative edges, and the 
same underlying graph, as the original yeast network, but with 100 edges, picked randomly, having different 
signs. Now, one needs 8.2% (88.3 ± 7.1) deletions, an amount in-between that obtained for the original yeast 
network and the one obtained for random graphs. Changing more signs, 100 positives and 100 negatives, 
leads to a less consistent network, with 115.4 ± 4.0 required deletions, or about 10.7% of the original edges, 
although still not as many as for a random network. 

Decomposing systems into monotone components 

Another motivation for the study of near-monotone systems is from decomposition-based methods for the 
analysis of systems that are interconnections of monotone subsystems. One may "pull out" inconsistent 
connections among monotone components, in such a manner that the original system can then be viewed as a 
"negative feedback" loop around an otherwise consistent system (Figure [8]). In this interpretation, the number 
of interconnections among monotone components corresponds to the number of variables being fed-back. 

For example, let us take the graph shown in Figure E)(a). The procedure of dropping the diagonal edge and 
seeing it instead as an external feedback loop can be modeled as follows. The original differential equation 
dxi/dt = fi(xi,X2,X3,X4) is replaced by the equation dx\jdt = f\{x\, X2, £3, u), where the symbol it, which 
represents an external input signal, is inserted instead of the state variable X4. The consistent system in 
Figure [8] includes the remaining four edges, and the "negative" feedback (negative in the sense that it is 
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Figure 8: Pulling-out inconsistent connections 



inconsistent with the rest of the system) is the connection from x±, seen as an "output" variable, back into 
the input channel represented by u. The closed-loop system obtained by using this feedback is the original 
system, now viewed as a negative feedback around the consistent system in Figure EJb). 

Generally speaking, the decomposition techniques in [18,19,39,41,42,60-66] are most useful if the feedback 
loop involves few variables. This is equivalent to asking that the graph G associated to the system be close 
to consistent, in the sense of the CD of G being small. This view of systems as monotone systems — which 
have strong stability properties, as discussed next, — with negative-feedback regulatory loops around them is 
very appealing from a control engineering perspective as well. 

2.0.1 Dynamical behavior of monotone systems 

Continuous-time monotone systems have convergent behavior. For example, they cannot admit any possible 
stable oscillations [38,67,68]. When there is only one steady state, a theorem of Dancer [69] shows -under 
mild assumptions regarding possible constraints on the values of the variables, and boundedness of solutions- 
that every solution converges to this unique steady state (monostability) . When, instead, there are multiple 
steady-states, the Hirsch Generic Convergence Theorem [37,38,51,52] is the fundamental result. A strongly 
monotone system is one for which the an initial perturbation £j(0) > Xj(0) on the concentration of any 
species propagates as a strict up or down perturbation: Zj(t) > Xj(t) for all £ > and all indices j for which 
o~j = o~i, and Zj(t) < Xj(t) for all t > and all j for which Oj = —o~i. Observe that this requirement is 
stronger (hence the terminology) than merely weak inequalities: Zj(t) > Xj(t) or zj(t) < Xj(t) respectively 
as in Kamke's Theorem. A sufficient condition for strong monotonicity is that the Jacobian matrices must 
be irreducible for all x, which basically amounts to asking that the graph G must be strongly connected and 
that every non-identically zero Jacobian entry be everywhere nonzero. Even though they may have arbitrarily 
large dimensionality, monotone systems behave in many ways like one-dimensional systems: Hirsch's Theorem 
asserts that generic bounded solutions of strongly monotone differential equation systems must converge to the 
set of steady states. ("Generic" means "every solution except for a measure-zero set of initial conditions.") In 
particular, no "chaotic" or other "strange" dynamics can occur. For discrete-time strongly monotone systems, 
generically also stable oscillations are allowed besides convergence to equilibria, but no more complicated 
behavior. 

The ordered behavior of monotone systems is robust with respect to spatial localization effects as well as 
signaling delays (such as those arising from transport, transcription, or translation). Moreover, their stability 
character does not change much if some inconsistent connections are inserted, but only provided that these 
added connections are weak ("small gain theorem") or that they operate at a comparatively fast time scale [70]. 

The intuition behind the convergence results is easy to explain in the very special case of just two interacting 
species, described by a two-dimensional system with variables x(t) and y(t): 



dx 

~dt 

dy 

dt 



f(x,y) 
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A system like this is monotone if either (a) the species are mutually activating (or, as is said in mathematical 
biology, "cooperative"), (b) they are mutually inhibiting ("competitive"), or (c) either x does not affect y, 
y does not affect x, or neither affects the other. Let us discuss the mutually activating case (a). (Case (b) 
is similar, and case (c) is easy, since the systems are partially or totally decoupled.) We want to argue that 
there cannot be any periodic orbit. Suppose that there would be a periodic orbit in which the motion is 
counterclockwise, as shown in Figure (9|a). We then pick two points in this orbit with identical x coordinates, 
as indicated by (x,y) and (x,y') in Figure Eta). These points correspond to the concentrations at two times 

: (x,y'> 



foy) l(x',y) 

'(x,y) 

Figure 9: Impossible (a) counterclockwise and (b) clockwise periodic orbits in planar cooperative system, each 
drawn in the (x, y)-plane. 

to, ti, with x(to) = x(ti) and y(to) < y(ti). Since y(t\) is larger than y(to), x is at the same concentration, 
and the species are mutually activating, it follows that the rate of change in the concentration x should be 
comparatively larger at time t\ than at time to, that is, f(x,y') > f(x,y). However, this contradicts the fact 
that x(t) is increasing at time to (f(x,y) > 0) but is decreasing at time t\ (f(x,y') < 0). The contradiction 
means that there cannot be any counterclockwise-oriented curve. To show that there cannot be any clockwise- 
oriented curve, one may proceed by an entirely analogous argument, using two points (x, y) and (x', y) as 
in Figure [H^b). Of course, the power of monotone systems theory arises in the analysis of systems of higher 
dimension, since two-dimensional systems are easy to study by elementary phase plane methods. 

For general, non-monotone systems, on the other hand, no dynamical behavior, including chaos, can be 
mathematically ruled out. This is in spite of the fact that some features of non- monotone systems are 
commonly regarded as having a stabilizing effect. For example, negative feedback loops confer robustness 
with regard to certain types of structural as well as external perturbations [71-74]. However, and perhaps 
paradoxically, the behavior of non-monotone systems may also be very fragile: for instance, they can be 
destabilized by delays in negative feedback paths. Nonetheless, we conjecture that systems that are close to 
monotone must be better-behaved, generically, than those that are far from monotone. Preliminary evidence 
(unpublished) for this has been obtained from the analysis of random Boolean networks, at least for discrete 
analogs of the continuous system, but the work is not yet definitive. 

Directed cycles 

Intuition suggests that somewhat less than monotonicity should suffice for guaranteeing that no chaotic 
behavior may arise, or even that no stable limit cycles exist. Indeed, monotonicity amounts to requiring 
that no undirected negative-parity cycles be present in the graph, but a weaker condition, that no directed 
negative parity cycles exist, should be sufficient to insure these properties. For a strongly connected graph, 
the property that no directed negative cycles exist is equivalent to the property that no undirected negative 
cycles exist, because the same proof as given earlier, but applied to directed paths, insures that a consistent 
spin assignment exists (and hence there cannot be any undirected negative cycles). However, for non-strongly 
connected graphs, the properties are not the same. On the other hand, every graph can be decomposed as 
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a cascade of graphs that are strongly connected. This means (aside from some technicalities having to do 
with Jacobian entries being not identically zero but vanishing on large sets) that systems having no directed 
negative cycles can be written as a cascade of strongly monotone systems. Therefore, it is natural to conjecture 
that such cascades have nice dynamical properties. Indeed, under appropriate technical conditions for the 
systems in the cascade, one may recursively prove convergence to equilibria in each component, appealing to 
the theory of asymptotically autonomous systems [75] and thus one may conclude global convergence of the 
entire system [76,77]. For example, a cascade of the form dx/dt = f(x), dy/dt = g(x,y) where the x system 
is monotone and where the system dy/dt = g(xo,y) is monotone for each fixed xq, cannot have any attractive 
periodic orbits (except equilibria). This is because the projection of such an orbit on the first system must be 
a point xq, and hence the orbit must have the form (xo,y(t)). Therefore, it is an attractive periodic orbit of 
dy/dt = g(xo,y), and by monotonicity of this latter system we conclude that y(t) = a constant as well. The 
argument generalizes to any cascade, by an inductive argument. Also, chaotic attractors cannot exist [78]. 

The condition of having no directed negative cycles is the weakest one that can be given strictly on the basis 
of the graph G, because for any graph G with a negative feedback loop there is a system with graph G which 
admits stable periodic orbits. (First find a limit cycle for the loop, and then use a small perturbation to define 
a system with nonzero entries as needed, which will still have a limit cycle.) 

Positive feedback and stability 

The strong global convergence properties of monotone systems mentioned above would seemingly contra- 
dict the fact that positive feedback, which tends to increase the direction of perturbations, is allowed in 
monotone systems, but negative feedback, which tends to stabilize systems, is not. One explanation for this 
apparent paradox is that the main theorems in monotone systems theory only guarantee that bounded solu- 
tions converge, but they do not make any assertions about unbounded solutions. For example, the system 
dx/dt = —x + x 2 has the property that every solution starting at an x(0) > 1 is unbounded, diverging to 
+oo, a fact which does not contradict its monotonicity (every one-dimensional system is monotone). This is 
not as important a restriction as it may seem, because for biochemical systems it is often the case that all 
trajectories must remain bounded, due to conservation of mass and other constraints. A second explanation 
is that negative self-loops are not ruled out in monotone systems, and such loops, which represent degradation 
or decay diagonal terms, help insure stability. 

Intuition on why negative self-loops do not affect monotonicity 

In the definition of the graph associated to a continuous-time system, self-loops (diagonal terms in the Jacobian 
of the vector field /) were ignored. The theory (Kamke's condition) does not require self-loop information 
in order to guarantee monotonicity. Intuitively, the reason for this is that a larger initial value for a variable 
Xi implies a larger value for this variable, at least for short enough time periods, independently of the sign 
of the partial derivative dfi/dxi (continuity of flow with respect to initial conditions). For example, consider 
a degradation equation dp/dt = —p, for the concentration p(t) of a protein P. At any time t, we have that 
p{t) = e~*p(0), where p(0) is the initial concentration. The concentration p(t) is positively proportional to 
p(0), even though the partial derivative = —1 is negative. Note that, in contrast, for a difference 

equation, a jump may occur: for instance the iteration p(t + 1) = —p(t) has the property that the order of 
two elements is reversed at each time step. Thus, for difference equations, diagonal terms matter. 
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Multiple time scale analysis may make systems monotone 



A system may fail to be monotone due to the effect of negative regulatory loops that operate at a faster 
time scale than monotone subsystems. In such a case, sometimes an approximate but monotone model may 
be obtained, by collapsing negative loops into self-loops. Mathematically: a non-monotone system might 
be a singular perturbation of a monotone system. A trivial linear example that illustrates this point is 
dx/dt = —x—y, edy/dt = —y+x, with e>0. This system is not monotone (with respect to any orthant cone). 
On the other hand, for e <C 1, the fast variable y tracks x, so the slow dynamics is well-approximated by 
dx/dt = — 2x (monotone, since every scalar system is). 

More generally, one may consider dx/dt = f(x,y), edy/dt = g(x,y) such that the fast system dy/dt = g(x,y) 
has a unique globally asymptotically stable steady state y = h{x) for each x (and possibly a mild input to state 
stability requirement, as with the special case edy/dt = —y + h(x)), and the slow system dx/dt = f(x, h(x)) is 
(strongly) monotone. Then one may expect that the original system inherits global convergence properties, at 
least for all e>0 small enough. The paper [79] employs tools from geometric invariant manifold theory [80,81], 
taking advantage of the existence of a manifold M e invariant for the dynamics, which attracts all near-enough 
solutions, and with an asymptotic phase property. The system restricted to the invariant manifold M £ is 
a regular perturbation of the fast (e = 0) system, and hence inherits strong monotonicity properties. So, 
solutions in the manifold will be generally well-behaved, and asymptotic phase implies that solutions track 
solutions in M e , and hence also converge to equilibria if solutions on M e do. However, the technical details 
are delicate, because strong monotonicity only guarantees generic convergence, and one must show that the 
generic tracking solutions start from the "good" set of initial conditions, for generic solutions of the large 
system. 

Discrete-time systems 

As discussed, for autonomous differential equations monotonicity implies that stable periodic behaviors will 
not be observed, and moreover, under certain technical assumptions, all trajectories must converge to steady 
states. This is not exactly true for difference equation models, but a variant does hold: for discrete-time 
monotone systems, trajectories must converge to either steady states or periodic orbits. In general, even the 
simplest difference equations may exhibit arbitrarily complicated (chaotic) behavior, as shown by the logistic 
iteration in one dimension x(t+l) = kx(t)(l—x(t)) for appropriate values of the parameter k [82]. However, for 
monotone difference equations, a close analog of Hirsch's Generic Convergence Theorem is known. Specifically, 
suppose that the equations are point-dissipative, meaning that all solutions converge to a bounded set [83], 
and that the system is strongly monotone, in the sense that the Jacobian matrix (dfi/dxj) is irreducible at 
all states. Then, a result of Terescak and coworkers [84-87] shows that there is a positive integer m such 
that generic solutions (in an appropriate sense of genericity) converge to periodic orbits with period at most 
m. Results also exist under less than strong monotonicity, just as in the continuous case, for example when 
steady states are unique [69]. 

Difference equations allow one to study wider classes of systems. As a simple example, consider the nondi- 
mensionalized harmonic oscillator (idealized mass-spring system with no damping), which has equations 

dx 

dt = V 
dy 

— = —x . 
dt 

(For this example, we allow variables to be negative; these variables might indicate deviations of concentrations 
from some reference value.) This system is not monotone, since v\ — ► i>2 is negative and v\ — > V2 is positive, so 

1 K 



that its graph has a negative loop. On the other hand, suppose that one looks at this system every At seconds, 
where At = ir. The discrete-time system that results (using a superscript + to indicate time-stepping) is now: 

x + = -y 
y + = —x 

(this is obtained by solving the differential equation on an interval of length tt). This system is monotone 
(both vi — > V2 and V\ — > V2 are negative). Every trajectory of this discrete system is, in fact, of period two: 
( x o> 2/o) — ¥ ( — 2/0) — ^o) ~~ * ( x o> 2/0 ) This periodic property for the difference equation corresponds to the 

period-27r behavior of the original differential equation. 



Oscillatory behaviors 



Stable periodic behaviors are ruled-out in autonomous monotone continuous-time systems. However, stable 
periodic orbits may arise through various external mechanisms. Three examples are (1) inhibitory negative 
feedback from some species into others in a monotone monostable system, (2) the generation of relaxation 
oscillations from a hysteresis parametric behavior by negative feedback on parameters by species in a monotone 
system, and (3) entrainment of external periodic signals. These general mechanisms are classical and well- 
understood for simple, one or two-dimensional, dynamics, and they may be generalized to the case where the 
underlying system is higher-dimensional but monotone. 



Embeddings in monotone systems 

As observed by Gouze [88,89], any n-dimensional system can be viewed as a subsystem of a 2n-dimensional 
monotone system. The mathematical trick is to first duplicate every variable (species), introducing a "dual" 
species, and then to replace every inconsistent edge by an edge connecting the source species and the "dual" of 
its target (and vice- versa). The construction is illustrated in FigllOl At first, this embedding result may seem 




Figure 10: (a) duplicated inconsistent graph, (b) replacing arrows and consistent assignment 



paradoxical, since all monotone (or strongly monotone) systems have especially nice dynamical behaviors, 
such as not having any attractive periodic orbits or chaotic attractors, and of course non-monotone systems 
may admit such behaviors. However, there is no contradiction. A non-monotone subsystem of a monotone 
system may well have, say, a chaotic attractor or a stable periodic orbit: it is just that this attractor or 
orbit will be unstable when seen as a subset of the extended (2n-dimensional) state space. Not only there is 
no contradiction, but a classical construction of Smale [90] shows that indeed any possible dynamics can be 
embedded in a larger monotone system. More generally, the Hirsch Generic Covergence Theorem guarantees 
convergence to equilibria from almost every initial condition; applied to the above construction, in general the 
exceptional set of initial conditions would include the "thin" set corresponding to the embedded subsystem. 
Yet, one may ask what happens for example if the larger 2n-system has a unique equilibrium. In that case, 
it is known [69] that every trajectory converges (not merely generic ones), so, in particular, the embedded 
subsystem must also be "well-behaved". Thus, systems that may be embedded by the above trick into 
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monotone systems with unique equilibria will have global convergence to equilibria. This property amounts 
to the "small gain theorem" shown in [39] , see [42] for a discussion and further results using this embedding 
idea. 

Discrete systems 

We remark that one may also study difference equations for which the state components are only allowed to 
take values out of a finite set. For example, in Boolean models of biological networks, each variable Xi(t) can 
only attain two values (0/1 or "on/off"). These values represent whether the ith gene is being expressed, or 
the concentration of the ith protein is above certain threshold, at time t. When detailed information on kinetic 
rates of protein-DNA or protein-protein interactions is lacking, and especially if regulatory relationships are 
strongly sigmoidal, such models are useful in theoretical analysis, because they serve to focus attention on 
the basic dynamical characteristics while ignoring specifics of reaction mechanisms [91-95]). 

For difference equations over finite sets, such as Boolean systems, it is quite clear that all trajectories must 
either settle into equilibria or to periodic orbits, whether the system is monotone or not. However, cycles in 
discrete systems may be arbitrarily long and these might be seen as "chaotic" motions. Monotone systems, 
while also settling into steady states or periodic orbits, have generally shorter cycles. This is because periodic 
orbits must be anti-chains, i.e. no two different states can be compared; see [37,96]. For example, consider a 
discrete-time system in which species concentrations are quantized to the k values {0, . . . , k— 1}; we interpret 
monotonicity with respect to the partial order: (a±, . . . , a n ) < (bi, . . . , b n ) if every coordinate aj < 6j. For 
non-monotone systems, orbits can have as many as k n states. On the other hand, monotone systems cannot 
have orbits of size more than the width (size of largest antichains) of P = {0, . . . , k— l} n , which can be 
interpreted as the set of multisubsets of an n-element set, or equivalently as the set of divisors of a number 
of the form [p\P2 ■ ■ -Pn)^~^ where the p^s are distinct primes. The width of P is the number of possible 
vectors . . . , i n ) such that ^ ij = \kn/2\ and each ij E {0, . . . , k—1}. This is a generalization of Sperner's 
Theorem; see [97]. For example, for n = 2, periodic orbits in a monotone system evolving on {0, . . . , k—1} 2 
cannot have length larger than k, while non-monotone systems on {0, ... , k—1} 2 can have a periodic orbit of 
period k 2 . As another example, arbitrary Boolean systems (i.e., the state space is {0, l} n ) can have orbits of 
period up to 2 ra , but monotone systems cannot have orbits of size larger than (i n / 2 j) ~ 2 n -^/2/(n7r). These 
are all classical facts in Boolean circuit design [96]. It is worth pointing out that any anti-chain Pq can be 
seen as a periodic orbit of a monotone system. This is proved as follows: we enumerate the elements of Po as 
xi, . . . ,xe, and define f(xi) = Xj_i for all i modulo i. Then, / can be extended to all elements of the state 
space by defining f(x) = (0, . . . , 0) for every x which has the property that x < x\ for some X{ G Po an d 
f(x) = (k—1, . . . , k—1) for every x which is not < Xj for any Xj € Po. It is easy to see that this is a monotone 
map [96,98]. 

While on the subject of discrete and in particular Boolean systems, we mention a puzzling fact: any Boolean 
function may be implemented by using just two inverters, with all other gates being monotone. In other 
words, a circuit computing any Boolean rule whatsoever may be built so that its "consistency deficit" is just 
two. This is a well-known fact in circuit design [96,99]. Here is one solution, from [100]. One first shows how 
to implement the Boolean function that takes as inputs three bits A, B, C and outputs the vector of three 
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complements (notA, not-B, notC), by using this sequence of operations: 



2or3ones 


= (A A B) V (A A C) V (B A C) 


Oorlones 


= not(2or3ones) 


lone 


= Oorlones A (A V B V C) 


lor3ones 


= lone V(AABAC) 


0or2ones 


= not(lor3ones) 


Oones 


= 0or2ones A Oorlones 


2ones 


= 0or2ones A 2or3ones 


notv4 


= Oones V (lone A(BV C)) V (2ones A (B A C)) 


notB 


= Oones V (lone A (A V C)) V (2ones A (A A C)) 


notC 


= Oones V (lone A (Ay B)) V (2ones A (A A B)) 



(the node labeled "2or3ones" computes the Boolean function "the input has exactly 2 or 3 ones" and so forth) . 
Note that only two inverters have been used. If we now want to invert four bits A, B, C, D, we build the above 
circuit, but we implement the inversion of the three bits (2or3ones, lor3ones, D) by a subciruit with only two 
inverters. With a similar recursive construction, one may invert an arbitrary number of bits, using just two 
inverters. 

(continued in Part 2) 
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